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We present the genome sequences of three hpAfrica2 strains of Helicobacter pylori, which are postulated to have evolved in iso- 
lation for many millennia in people of San ethnicity. Although previously considered to be ancestral to Helicobacter acinonychis, 
the hpAfrica2 strains differ markedly from H. acinonychis in their gene arrangement. These data provide new insights into Heli- 
cobacter evolution. 
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Helicobacter pylori chronically colonizes the stomachs of bil- 
lions of people worldwide, sometimes leading to gastroduo- 
denal disease, although usually without symptoms and sometimes 
with beneficial effects ( 1 ) . H. pylori is very diverse genetically, with 
different sets of genotypes predominating in well-separated hu- 
man populations (2, 3). Two superlineages have been described: 
one including hpNEAfrica, hpAfrical, and all non- African H. py- 
lori strains and one including hpAfrica2, which is postulated to 
have diverged from mainstream H. pylori some 100,000 years ago 
and evolved in isolation in southern African San (hunter-gatherer, 
click-speaking) societies (4). We sequenced the genomes of three 
strains classified by multilocus sequence typing (MLST) as being 
of the unusual hpAfrica2 type, found among 17 strains cultured 
with informed consent from patients from Soweto in Chris Hani 
Baragwanath Hospital, Johannesburg, South Africa, from 1997 to 
1999. Notably, these patients were of Bantu, not San, ethnicity. 

DNAs from single colonies were sequenced using 454 methods 
(read lengths, >400 bp; coverage, >25-fold) and assembled with 
454 Newbler software. All gaps between contigs were filled by PCR 
and capillary sequencing (5), and sequences were annotated at 
NCBI (http://www.ncbi.nlm.nih.gov/genome/annotation_prok/; 
strain SouthAfrica7) or with the Institute for Genome Sciences 
(IGS) Annotation Engine (http://ae.igs.umaryland.edu/cgi/index 
.cgi) (6). The three SAfrica strain genome sizes are 1.6 to 1.7 Mb, 
which is typical for H. pylori. 

A BLASTn analysis of sequential 1-kb fragments genome-wide 
indicated that these SAfrica strains are closely related to one an- 
other, with little if any admixture from non-hpAfrica2 strains. 
nWayComp analysis (7) identified 173 open reading frames 
(ORFs) in Helicobacter acinonychis (GenBank accession no. 
NC_008229.1, postulated to descend from hpAfrica2) (4) that are 
absent in the SAfrica strains and, conversely, 93 ORFs in the SAf- 



rica strains that are absent from H. acinonychis. One gene present 
in all three SAfrica strains (HPSA_05790, encoding the DnaJ-like 
heat shock protein) had no close homolog in the current NCBI 
database, and another (HPSA_06345, hypothetical) was absent 
from all but one other H. pylori strain. Each SAfrica strain lacks the 
cag pathogenicity island and contains a type II hopQ outer mem- 
brane protein gene. The SAfrica strains contain s2/m2-type vacA 
genes (although vacA in SouthAfrica7 is frameshifted). These fea- 
tures are associated with low gastroduodenal disease risk. 

Mauve analysis (8) illustrated that the H. pylori genome orga- 
nization is well conserved, with <10 rearrangements (inversions/ 
translocations, defined as locally colinear blocks; http://gel.ahabs 
.wisc.edu/mauve/) among the SAfrica strains and <20 such 
differences between the SAfrica and non-hpAfrica2 strains, such 
as J99. However, >100 rearrangements distinguished H. aci- 
nonychis from these putatively ancestral hpAfrica2-type strains. 
The SAfrica strains lack all but one H. acinonychis prophage gene 
(a gene also present in many prophage-free H. pylori strains and 
thus probably not of viral origin). 

Collectively, these outcomes heighten interest in the proposed 
ancestry relationships between H. pylori and H. acinonychis (4) 
and more generally between the many helicobacters of humans 
and other mammals. Further analyses of hpAfrica2 genomes 
should provide new insights into important evolutionary ques- 
tions — for example, how bacterial superlineages and distinct spe- 
cies (here, hpAfrica2 from ethnic Bantu Soweto residents, and 
H. acinonychis) arose and diverged from ancestral forms and re- 
mained distinct genome- wide — especially for species such as H. 
pylori, generally considered to enjoy "free recombination" (9). 

Nucleotide sequence accession numbers. The genome se- 
quences of these H. pylori strains have been deposited in GenBank 
under the following accession no.: SouthAfrica7, CP002336; 
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SouthAfrica20, CP006691; and SouthAfrica50, AVNIOOOOOOOO. 
The versions described in this paper are the first versions (South- 
Africa?, CP002336.1; SouthAfrica 20, CP006691.1; and SouthAf- 
rica 50, AVNIO 100000 1 . 1 and AVNIO 1000002. 1 ) . 
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